Date Sequences and Missingness

Per a CDC/GTRI meeting on 10/22/2020, we have decided to examine the HL7 data dates more closely for the purpose of synthetic data generation. Generation around dates require special considerations because certain logical orders/relationships may need to be preserved when generating them synthetically (e.g. One can't be discharged from the hospital before being admitted).

The notebook explores the following:

Overall Missingness

Missingness by Condition

Missingness by Jurisdiction

Missingness by Condition and Jurisdiction

Missingness by Jurisdiction and Condition

Date Sequences

For any two date variable tuples (a, b), whenever both a and b are present, is a < b?